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ABSTRACT:  Under  acidic  conditions  (50  equiv  of  TFA),  combinations  of 
hydrazide  A-B  monomers  self- assemble  into  octameric  [2] -catenanes  with  high 
selectivity  for  [  132]  2,  where  1  is  a  D-Pro-X  (X  =  Aib,  Ac4c,  Ac6c,  L-4-Cl-PhGly)- 
derived  monomer  and  2  is  an  L-Pro'-L-arylGly  (Pro'  =  Pro,  trans- F-Pro,  trans-HO- 
Pro,  aryl  =  naphthyl,  phenyl) -derived  monomer.  Five  different  combinations  of 
monomers  were  studied  by  X-ray  crystallography.  In  each  case,  the  unique  aryl 
glycine  unit  is  located  in  the  core  of  the  structure  where  the  aryl  ring  templates  a 
CH— n—  CH  sandwich.  Analysis  of  metrical  parameters  indicates  that  this  core 
region  is  highly  conserved,  while  the  more  peripheral  zones  are  flexible.  XH  NMR  spectroscopy  indicate  that  the  solid-state 
structures  are  largely  retained  in  solution,  though  several  non-C2-symmetric  compounds  have  a  net  C2-symmetry  that  indicates 
accessible  dynamic  processes.  Catenane  dynamic  processes  were  additionally  probed  through  H/D  exchange,  with  the  core  being 
inflexible  relative  to  the  peripheral  structure.  Mass  spectrometry  was  utilized  to  identify  the  constitutional  isomerism  in  the  minor 
asymmetric  [l523]  catenanes. 


■  INTRODUCTION 

The  self-assembly  of  complex  structures  from  simple  building 
blocks  is  a  general  principle  that  can  lead  to  significant  gains  in 
structure  and  function.1  As  exemplified  by  the  protein  folding 
problem,2  global  minima  are  achieved  through  the  interplay  of 
many  weak  interactions,  the  quantitation  of  which  is  often 
challenging.  Within  this  context,  it  is  the  balance  of  non- 
covalent  forces  (H-bonding,  electrostatic,  n— n,  cation —jt, 
CH —n,  and  hydrophobic  effects)  that  dictate  thermodynamic 
minima.  Model  systems  have  been  widely  utilized  to  investigate 
these  contributing  non-covalent  interactions  in  relative 
isolation,  with  the  ease  of  manipulation  and  characterization 
enabling  significant  insight  into  the  nature  and  driving  force  of 
such  interactions.3  A  key  limitation,  however,  is  that  few  such 
model  systems  simulate  the  cooperative  folding  and  binding 
found  in  proteins.4 

We  report  herein  structural  studies  of  a  synthetic  self¬ 
assembling  system  that  bridges  the  complexity/manipulability 
gap  that  exists  between  small-molecule  model  systems  and 
complex  biosystem  such  as  proteins.  The  self-assembled 
structure  is  generated  from  dipeptide  hydrazide  monomers 
into  discrete  octameric  [2] -catenanes  of  molecular  weight  >3 
kDa.  A  structure  analysis  reveals  that  the  compounds  rely  on  H- 
bonding,  n— n,  and  CH —n  interactions  along  with  biogenic  /?- 
turns  to  achieve  their  minimum-energy  structures.  The 
combination  of  large  size,  complexity,  and  structural  manipu- 
lability  provides  the  opportunity  to  gauge  how  the  balance  of 
forces  influences  the  energetics  of  this  self-assembled  structure, 
and  thus  bridges  the  gap  between  small  molecules  and  proteins. 

Templated  synthesis  is  a  long-established  strategy  for 
accessing  entropically  disfavored  compounds  and  can  provide 
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more  complex  recognition  systems  than  designed  host— guest 
pairs.  A  dynamic  variant  has  been  recently  developed  by 
Sanders,  Lehn,  and  others  as  a  means  of  discovering  tight- 
binding  hosts  for  a  broad  variety  of  guests.5  Termed  dynamic 
combinatorial  or  constitutional  chemistry  (DCC),  this  strategy 
enables  molecular  receptors  to  be  selected  from  dynamic 
libraries  of  potential  hosts  without  prior  knowledge  of  the 
guest's  structure.  By  its  very  nature,  however,  the  competitive 
selection  masks  the  recognition  elements  that  create  the 
equilibrium  host— guest  structure.  While  much  is  known  about 
the  constitution  and  binding  properties  of  these  host  and  host- 
guest  assemblies,  little  is  known  about  their  molecular 
structures. 

In  the  course  of  our  own  experiments  examining  complex 
libraries  assembled  from  mixtures  of  dipeptide  monomers,6  we 
noted  that  one  specific  combination  of  two  monomers  led  to 
the  self-assembly  of  a  [2] -catenane  composed  of  56-membered 
macrocyclic  tetramers,  each  of  which  was  composed  of  a  3:1 
mixture  of  the  two  monomers  (Figure  l).7  While  this  [2]- 
catenane  recognizes  itself,  Sanders  has  previously  demonstrated 
that  catenated  dipeptide  trimers  will  bind  quat-salts  like 
acetylcholine  with  sub-micromolar  affinities,  the  host— guest 
complex  of  which  could  be  characterized  by  NMR8  The  lack  of 
solid-state  structural  data  on  these  and  the  other  receptors 
assembled  from  peptide  monomers8-10  prompted  us  to  utilize 
X-ray  methods  in  conjunction  with  solution-phase  studies  to 
structurally  characterize  a  number  of  additional  [2] -catenanes. 
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Figure  1.  Cartoon  showing  the  flower-like  structure  of  the  [2]- 
catenane  composed  of  two  interlocked  identical  tetramers. 


Our  communicated  X-ray  structure  of  [2]-catenane  H3a 
revealed  that  the  presence  of  four  /Tturns  in  each  tetramer 
results  in  collapse  of  the  large  macrocycle  into  a  conformer  that 
converges  functionality  and  enables  self-recognition  (Figure 
l).7  Inter-  and  intra-macro  cycle  hydrogen  bonds,  n— n  stacking, 
and  aliphatic  CH —n  interactions  all  appear  to  contribute  to  the 
high-fidelity  recognition  of  one  macrocycle  by  the  other.  In 
addition  to  being  physically  interlocked,  two  /Lturns  were 
found  to  create  a  clip  that  binds  the  aromatic  portion  of  the  C2- 
related  macrocycle,  using  the  C— H  bonds  of  one  proline  ring 
and  the  C— H  bonds  of  one  Aib  methyl  substituent  (Aib  =  1- 
aminoisobutyric  acid,  or  a-methyl-alanine)  as  the  teeth  of  the 
clip,  i.e.,  a  CH— n—  CH  sandwich.  Due  to  the  C2-nature  of  the 
catenane,  this  binding  interaction  is  repeated  in  the  symmetry- 
equivalent  position.  Because  the  catenane  was  found  to  form 
only  when  one  monomer  contained  the  phenylglycine  or 
naphthylglycine  side  chain  that  forms  the  CH— n—  CH 
sandwich,  it  appeared  that  this  was  a  critical  element  required 
for  self-templating  of  the  catenane. 

To  gain  further  insight  into  the  relative  contribution  of  each 
of  these  non-covalent  interactions11-13  to  the  tightly  con¬ 
strained  self-complementary  structure,  we  designed  a  series  of 
mutated  [2]-catenanes  and  investigated  their  structure/stability 
relationship.  This  broad  study  is  reported  herein  in  two  parts. 
The  first  focuses  on  structure,  principally  utilizing  X-ray 
methods  but  also  supported  by  solution  characterization 
tools,  including  high-resolution  NMR  spectroscopy  to  probe 
the  similarity  of  solid-state  and  solution  structures,  and 
solution-phase  H/D  exchange  to  probe  structural  dynamics 
and  solvent  accessibility.  These  tools  are  complemented  by  an 
in-depth  mass  spectrometry  fragmentation  analysis,  which 
enables  constitutional  isomerism  to  be  deconvoluted.  In  this 
first  paper,  we  address  the  following  questions:  What 
interactions  are  highly  conserved?  Are  there  parts  of  the 
structure  that  can  accommodate  variation?  Are  the  features 
observed  in  the  solid  state  maintained  in  solution?  Each  of 
these  questions  is  addressed  with  regard  to  the  /Lturns, 
hydrogen  bonds,  and  n—n  stacking  interactions  in  the  “aryl 
barrel”  and  the  CH— n—  CH  sandwich.  Lastly,  the  solid-state 
and  solution-phase  structures  of  two  non-symmetric  catenanes 
made  up  of  two  different  rings  are  investigated  and  compared 
to  the  C2  symmetric  catenanes. 

The  second  paper  in  this  series14  utilizes  mutation 
experiments  to  determine  the  relative  contribution  of  each  of 
the  non-covalent  interactions  to  the  structure  and  the  roles  of 


multivalency  and  cooperativity  in  folding  and  molecular 
recognition  in  these  catenanes. 

These  studies  provide  insight  into  the  effect  of  a  broad 
variety  of  multivalent  non-covalent  interactions  on  the  stability 
of  a  self-assembled  [2] -catenane  in  organic  solvents.  The 
intermediate  size  (>3k  Da)  and  complexity  of  the  [2] -catenane 
help  to  bridge  the  gap  between  small-molecule  models  and 
proteins  without  sacrificing  the  ability  to  precisely  tune 
structure,  obtain  relative  stability  data,  and  structurally 
characterize  with  high-resolution  methods  in  solution,  in  the 
solid  state,  and  in  the  gas  phase  (MS). 

■  EXPERIMENTAL  SECTION 

General  Methods.  Chemicals  were  purchased  from  Aldrich,  Fisher 
Scientific,  and  Chem-Impex  International,  Inc.  and  used  as  received. 
*H  and  13C  NMR  spectra  of  monomers  and  *H,  13C,  gradient  TOCSY, 
ROESY,  and  HSQC  NMR  spectra  of  [2] -catenanes  (F3b,  ho3c,  F3dph, 
F4b)  were  recorded  on  a  Bruker  DRX  500  spectrometer  or  a  Bruker 
600  Cryoprobe  spectrometer  and  processed  using  TOPSPIN  Bruker 
NMR  software  (V.  3.0).  High-resolution  mass  spectra  were  obtained 
on  an  Agilent  Accurate  LC-TOF  mass  spectrometer  (Agilent  Series 
6220)  operating  in  positive-ion  mode  with  electrospray  ionization 
source  (fragmentor  =  175  V).  The  data  were  analyzed  using  Agilent 
MassHunter  Workstation  Software,  Qualitative  Analysis  (V.  B.02.00). 
HPLC  analysis  was  performed  on  a  Hewlett-Packard  Series  1100 
instrument,  using  a  Halo-C18  column  (4.6  X  150  mm,  2.7  //m)  with 
gradient  elution  (methanol/water)  at  a  flow  rate  of  0.45  mL/min  and 
at  55  °C.  The  injection  volume  for  a  5  mM  dynamic  self-assembly 
(DSA)  reaction  was  typically  3.5  //L.  UV  absorbance  chromatograms 
were  recorded  at  wavelengths  of  220  and  289  nm.  The  data  were 
analyzed  using  Agilent  Chemstation.  Accurate  Mass  LC-TOF  analyses 
of  DSA  reactions  were  performed  on  an  Agilent  Series  1200  LC 
instrument,  using  a  Halo-C18  column  (2.1  X  50  mm,  2.7  // m)  with 
gradient  elution  (methanol/ water  containing  0.1%  formic  acid)  at  a 
flow  rate  of  0.5  mL/min  and  at  50  °C.  The  eluent  was  analyzed  by  an 
Accurate  Mass  LC/MS  TOF  mass  spectrometer  (Agilent  Series  6220) 
in  positive-ion  mode  with  an  electrospray  ionization  source 
(fragmentor  =  375V).  The  data  were  analyzed  using  Agilent 
MassHunter  Workstation  Software,  Qualitative  Analysis  (V. 
B.02.00).  The  isolation  of  [2] -catenanes  was  performed  on  a  modified 
semi-preparative  HPLC  (Agilent  Series  1200  LC  instrument),  using  an 
Agilent  Zorbax  Eclipse  XDB-C18  PrepHT  Cartridge  column  (21.2  X 
250  mm,  7  fim)  with  isocratic  methanol/water  elution  (no  additive)  at 
a  flow  rate  of  9  mL/min  and  at  55  °C.  The  eluent  was  monitored  by 
UV  absorbance  chromatogram  at  a  wavelength  of  289  nm,  and  the 
corresponding  [2] -catenanes  were  collected  with  a  fraction  collector. 
MS/MS  analyses  of  the  isolated  [2] -catenanes  and  DSA  reactions  were 
performed  on  an  Accurate  Mass  LC-QTOF  (Agilent  Series  6520) 
equipped  with  an  Agilent  Series  1200  LC  instrument  without/with  a 
column  (Halo-C18,  2.1  X  50  mm,  2.7  //m)  in  positive-ion  mode  with 
an  electrospray  ionization  source.  X-ray  crystallographic  analyses  of 
[2] -catenanes  F3dph,  ho3c,  F3b,  and  F4b  were  performed  on  a  Bruker- 
AXS  SMART  APEX-II  system  equipped  with  a  graphite  mono¬ 
chromator.  [Please  contact  Dr.  Peter  S.  White  (pwhite(a)email.unc. 
edu)  for  correspondence  regarding  X-ray  analyses.] 

Generation  and  Isolation  of  F3dph,  H03c,  F3b,  and  F4b.  For 
F3dph,  two  5  mM  DSA  solutions  (D-ld/L,L-F2ph  =1:1)  were  prepared 
on  a  20  mL  scale.  Each  was  prepared  by  dissolving  the  mixture  of  D-ld 
(20.2  mg,  50.0  //mol)  and  L,L-F2ph  (22.9  mg,  50.0  //mol)  in  chloroform 
(20  mL)  and  subsequently  adding  trifluoroacetic  acid  (50  equiv,  5000 
//mol,  371  // L).  The  solutions  were  allowed  to  sit  for  6  days,  and  then 
triethylamine  (50  equiv,  5000  //mol,  0.7  mL)  was  added  to  quench  the 
dynamics.  To  remove  the  precipitates  generated  during  the  DSA 
process  due  to  the  poor  solubility  of  other  oligomers  in  chloroform 
and  after  the  addition  of  triethylamine,  the  solutions  were  filtered,  and 
the  two  were  combined.  The  volatiles  were  removed  in  vacuo,  and 
chloroform  (50  mL)  was  added.  This  solution  was  washed  with  water 
(40  mL)  and  then  dried  over  anhydrous  MgS04.  More  chloroform  (80 
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mL)  was  added;  and  the  solution  was  heated  to  50  °C  (to  mitigate  the 
poor  solubility  of  F3dph  in  CHC13)  and  then  filtered  to  remove 
MgS04.  The  volatiles  were  removed  in  vacuo  again,  and  the  purity  of 
the  crude  product  was  determined  by  HPLC.  Crude  F3dph  contained 
4%  F4dph,  and  without  further  purification,  it  was  recrystallized  from 
pyridine  by  slow  evaporation  at  room  temperature  to  generate  crystals 
suitable  for  X-ray  analysis.  For  ho3c,  two  biased  monomer  mixtures 
(d-1c/l,l-ho2  =  3:l)  were  prepared  on  a  20  mL  scale.  Each  was 
prepared  by  dissolving  d-1c  (35.6  mg,  75.0  //mol)  and  l,l-ho2  (12.7 
mg,  25.0  //mol)  in  25%  acetonitrile /chloroform  (20  mL),  to  which  was 
subsequently  added  trifluoroacetic  acid  (50  equiv,  5000  //mol,  371 
//L).  The  reactions  were  allowed  to  sit  for  3  days,  and  then 
triethylamine  (50  equiv,  5000  //mol,  0.7  mL)  was  added  to  neutralize 
the  acid.  The  solutions  were  combined,  and  the  volatiles  were  removed 
in  vacuo.  Methanol  (8  mL)  was  added  to  the  resulting  mixture,  which 
was  then  filtered.  This  solution  was  eluted  on  the  semi-preparative 
HPLC  with  87%  methanol/ water  isocratic  solution  to  isolate  ho3c. 
The  collected  ho3c  was  recrystallized  from  methanol/chloroform  by 
slow  evaporation  at  room  temperature  to  obtain  crystals  suitable  for  X- 
ray  analysis.  For  F3b  and  F4b,  three  DSA  solutions  (D-lb/L,L-F2  =1:1) 
were  prepared  on  a  20  mL  scale.  Each  was  obtained  by  dissolving  a 
mixture  of  D-lb  (27.0  mg,  50.0  //mol)  and  l,l-f2  (19.1  mg,  50.0  //mol) 
in  25%  acetonitrile/chloroform  (20  mL)  and  adding  trifluoroacetic 
acid  (50  equiv,  5000  //mol,  371  //L).  The  reactions  were  allowed  to  sit 
for  9—10  days,  and  then  triethylamine  (50  equiv,  5000  //mol,  0.7  mL) 
was  added  to  quench  the  reaction.  All  solutions  were  combined,  and 
the  volatiles  were  removed  in  vacuo.  Next,  65%  methanol/chloroform 
(15  mL)  was  added  to  the  resulting  mixture,  which  was  then  filtered 
prior  to  semi-preparative  HPLC  separation  (95%  isocratic  methanol/ 
water  elution).  The  collected  F3b  and  F4b  were  recrystallized  from 
methanol/chloroform  by  slow  evaporation  at  room  temperature  to 
obtain  crystals  suitable  for  X-ray  analysis. 

MS/MS  Analysis  of  Isolated  F4b  and  DSA  Reaction  Mixtures 
Containing  F4b  and  H4a.  For  the  isolated  F4b,  the  sample  was 
prepared  by  first  dissolving  in  methanol  and  then  eluting  with  an 
isocratic  eluent  (50%  methanol/ water  containing  0.1%  formic  acid)  at 
a  flow  rate  of  0.2  mL/min  for  2  min.  Injection  volume  was  1  // L.  The 
MS  source  nebulizer  gas  and  Vcap  parameters  were  set  to  35  psig  and 
3500  V,  respectively.  The  fragmentor  parameter  was  adjusted  to  either 
175.0  or  375.0  V,  depending  on  the  experiments.  The  other  MS  source 
parameters  were  default  settings.  The  ramped  collision  energy  was 
adjusted  by  slope  (typically  2)  and  offset  (0  to  30)  values.  The  MSI 
and  MS2  data  were  collected  in  the  range  of  100—3200  ( m/z )  with  a 
reference  ion  (m/z  =  922.009798).  With  a  175.0  V  fragmentor  setting, 
the  targeted  MSI  ion  was  the  abundant  doubly  charged  ion  [ F4b  + 
2H]2+  (m/z  =  1588.7219)  under  ESI  conditions.  In  the  experiment 
with  a  375.0  V  fragmentor  setting,  the  targeted  MSI  ion  was  the  singly 
charged  ion  [lb2F22  +  H]+  (m/z  =  1626.7100).  For  DSA  reaction 
mixtures  containing  F4b  and  H4a,  0.1  mL  of  a  DSA  solution  was 
diluted  with  0.1  mL  of  25%  acetonitrile/chloroform  solution  prior  to 
the  injection,  and  then  the  mixture  was  eluted  with  a  gradient  profile 
(methanol/water  containing  0.1%  formic  acid)  at  a  flow  rate  of  0.5 
mL/min  and  at  50  °C.  The  injection  volume  was  typically  2  //L.  MS 
source  parameters,  the  ramped  collision  energy,  and  the  MSI  and  MS2 
data  collection  ranges  were  set  in  the  same  way.  For  the  DSA  solutions 
generating  F4b,  the  fragmentor  parameters  and  targeted  MSI  ions 
were  the  same  as  those  for  the  isolated  F4b.  For  the  DSA  solutions 
providing  H4a,  175.0  and  420.0  V  were  chosen  as  the  fragmentor 
settings.  At  175.0  V,  the  targeted  MSI  ion  was  the  doubly  charged 
[H4a  +  2H]2+  (m/z  =  1461.1533),  while  at  420.0  V,  the  targeted  MSI 
ion  was  the  singly  charged  [la2H22  +  H]+  (m/z  =  1510.6581). 

■  RESULTS 

1.  Self-Assembly  and  Isolation  of  [2]-Catenanes.  As 

shown  in  Scheme  1,  the  combination  of  the  D-Pro-X  monomer 
[(d-  or  d,l-1x;  X  =  Aib,  1-aminocyclobutanecarboxylic  acid 
(Ac4c),  1- amino  cyclohexane  carboxylic  acid  (Ac6c),  l-(4- 
chlorophenyl) glycine  (L-4-Cl-PhGly)]  and  L-yPro-L-arylGly 
[l,l-72  or  y2p  ;  y  =  H,  F  or  OH;  aryl  =  Naph  or  Ph, 


Scheme  1.  [2]-Catenanes  Self- Assembled  from  Pairs  of 
Hydrazide/Acetal  Monomers 


respectively]  under  the  acidic  conditions  necessary  for 
promoting  hydrazone  exchange  yields  a  variety  of  cyclic 
oligomers,  with  dimers,  trimers,  and  tetramers  predominating 
(traces  of  pentamers  and  hexamers  are  also  detected  by  HPLC 
and  LC-TOF  analyses)  (SI1  Figure  l). 

Also  appearing  under  these  dynamic  self-assembly  (DSA) 
reaction  conditions  are  significantly  less  polar  species  that 
steadily  grow  to  a  maximum  after  3—5  days.  LC-TOF  analyses 
showed  them  to  be  catenated  octamers  as  judged  by  the 
octamer  to  tetramer  fragmentation  pattern,  which  is  character¬ 
istic  of  an  interlocked  structure  (vide  infra)  (SI1  Figures  2— 
4). 15  Two  [2]-catenanes  dominated,  a  cyclic  octamer  (3)  made 
up  of  two  rings  having  identical  mass,  i.e.  [l321]2^  and  an 
asymmetric  octamer  (4)  composed  of  interlocked  [132^  and 
[l222]  rings  (Figure  2). 

In  some  cases,  other  octamers  having  a  mass  of  [l424]  as  well 
as  constitutional  isomers  of  both  3  and  4  were  observed  in  low 
quantities  (<10%,  Figure  2b).  While  the  amount  of  catenane  3 
was  usually  improved  through  the  use  of  biased  component 
mixtures  (i.e.,  1:2  =  3:l),  the  yield  of  4  was  typically  not 
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Figure  2.  (a)  HPLC-UV  trace  (289  nm)  at  day  7  of  a  DSA  reaction 
with  D-la  and  l,l-h2  (1:1,  5  mM  in  MeCN/CHCl3  =  1:3,  50  equiv  of 
TFA).  (b)  HPLC-UV  trace  (289  nm)  at  day  7  of  a  DSA  reaction  with 
D-lb  and  l,l-f2  (1:1,  5  mM  in  MeCN/CHCl3  =  1:3,  50  equiv  of  TFA). 
Inset:  [2]-catenane  region. 

sensitive  to  the  feed  ratio.  The  choice  of  the  reaction  solvent, 
however,  strongly  affects  the  [2]-catenane  speciation,  with  pure 
chloroform  giving  3  exclusively  while  25%  MeCN/CHCl3 
maximized  the  amount  of  4.  Using  these  conditions,  a  series 
of  [2]-catenanes  were  self-assembled  and  isolated  after 
neutralizing  the  reaction  solution  to  halt  hydrazone  exchange; 
additional  purification  was  achieved  either  by  flash  column 
chromatography  or  by  semi-preparative  HPLC.  Slow  concen¬ 
tration  of  these  products  from  MeOH/ CHC13  or  pyridine  led 
directly  to  X-ray-quality  crystals. 

For  the  purpose  of  this  and  the  following  paper,  3 
corresponds  to  the  C2-symmetric  [2]-catenane  that  is  made 
up  of  two  interlocked  macrocycles  assembled  from  three  1 
monomers  and  one  2  monomer.  The  Ph  superscript 
designation  refers  to  the  phenyl  glycine  version  of  2  (cf. 
naphthylglycine),  while  the  x  and  y  designations  correspond  to 
the  D-monomer  and  naphthylglycine  variants  described  in 
Scheme  1,  respectively. 

2.  Structural  Analysis  of  [2]-Catenanes  Having  Two 
Identical  Rings  (y3x).  2.1.  Solid-State  Analysis  (X-ray). 
Previously  we  communicated  the  high-resolution  solid-state 
structural  features  of  H3a.7  Its  most  striking  feature  was  its 
compact  structure  and  interlocked  C2-symmetric  nature  (Figure 
3).  The  structure  can  be  loosely  described  as  flower-like  with  a 
dense  core  containing  the  two  unique  L-proline-L-naphthyl 
glycine  (L-Pro-L-NaphGly)  units,  from  which  radiate  four  loops 
that  terminate  in  (3- turns.  One  loop  from  each  macrocycle  is 
splayed  out  in  a  petal-like  arrangement,  while  the  second  is 
tightly  associated  with  its  C2-related  pair  in  a  pillar-like  central 
projection  (Figure  3a). 

Like  the  petal-loops,  the  pillar-like  loops  terminate  in  a 
proline-induced  (3- turn.  Each  of  the  individual  cyclic  tetramers 
adopts  a  roughly  saddle-shaped  conformation,  with  each  of  the 
four  Pro-X  sequences  adopting  a  /Tturn  structure  that  reverses 
the  direction  of  the  chain.  The  fold  of  the  tetramers  is  such  that 
the  /?-turns  create  a  clip-like  arrangement  that  positions  a 
proline  and  an  Aib  methyl  above  and  below  the  naphthyl  ring 


(a)  Petal-like  Pillar-like  paired  (b)  L-Pro-X 


Figure  3.  (a)  X-ray  structure  of  H3a  displaying  two  identical 
interlocked  tetramers.  (b)  View  from  the  bottom  of  (a)  along  the 
C2  axis  showing  a  dense  core.  All  hydrogen  atoms  are  omitted  for 
clarity. 


of  the  other  macrocycle.  The  metrical  parameters  of  these  units 
suggests  a  nearly  perfect  arrangement  for  a  beneficial  CH —n 
interaction.16 

Similar  features  were  observed  in  a  series  of  new  [2]  - 
catenanes  (F3b,  ho3c,  F3dph,  Figure  4)  self-assembled  from 
structurally  modified  monomer  units  (Scheme  l).  Each  is 
flower-like  with  a  dense  core  containing  the  aryl  glycine  units 
sandwiched  between  L-proline  and  a  variable  unit  that  includes 
1-aminocyclobutanecarboxylic  acid  (Ac4c),  1-aminocyclohex- 
anecarboxylic  acid  (Ac6c),  and  L-(4-chlorophenyl) glycine  (l-4- 
Cl-PhGly).  Like  H3a,  each  tetrameric  ring  is  saddle-shaped, 
with  four  (3- turns  connected  by  a  planar,  extended  1,4- 
hydrazone/benzamide  linkage. 

2.1.1.  /3-Turn  Characterization.  While  proline  is  well  known 
to  nucleate  turn  structures  in  peptides  and  proteins,10  the 
structure  adopted  by  the  Pro-X  segments  (X  =  variable  unit)  in 
these  small-molecule  receptors  has  not  previously  been 
established.  In  each  of  the  structurally  characterized  catenanes, 
however,  it  is  clear  from  metrical  parameters  (O,  'P  analysis,  SI1 
Table  l)  that  the  Pro-X  segment  adopts  a  standard  suite  of  (3- 
turns.  For  the  purpose  of  this  discussion,  the  /Tturns  are 
identified  as  (3-1— (3-4,  with  / 3-2  (D-Pro-X  unit)  and  (3-4  (L-Pro-L- 
arylGly  unit)  making  up  the  core,  and  (3- 1  and  (3- 3  (D-Pro-X 
units)  creating  the  turns  at  the  periphery  of  the  central  pillar¬ 
like  and  petal-like  structures,  respectively  (Figure  4).  In  general 
(3-2  and  (3-4  were  invariant,  adopting  a  type  IT  and  type  VIII 
turn,  respectively.17  Depending  on  the  structural  modifications, 
the  peripheral  (3-1  and  (3- 3  turns  were  more  variable,  with  type 
IT  or  type  T  being  accessible.17 

A  comparative  analysis  of  the  /Tturn  metrical  parameters  of 
the  four  structurally  characterized  catenanes  was  informative 
(Table  l).  Most  striking  was  the  invariability  of  the  aryl  glycine 
unit  in  the  central  core  ((3-4),  as  almost  no  dihedral  angle 
differences  (<9°)  were  noted  across  the  series  of  structures. 
The  D-Pro-X  unit,  which  provides  the  second  half  of  the 
molecular  clip  in  the  core  ((3-2),  was  also  tightly  constrained, 
though  the  L-4-Cl-PhGly  case  (ho3c),  which  positions  the  4-C1- 
phenyl  unit  below  the  naphthyl  core  and  is  non-C2-symmetric, 
displayed  larger  deviations.  By  contrast,  the  peripheral  /Tturns 
((3-1,  (3- 3)  were  considerably  more  flexible,  and  were  responsive 
to  changes  in  the  D-Pro-X  unit.  For  example,  the  Ac6c  group  in 
F3b  generated  high  torsional  angle  deviation  in  (3- 3,  while  the 
Ac4c  group  in  F3dph  and  the  L-4-Cl-PhGly  group  in  ho3c 
caused  deviations  in  / 3-3  and,  more  noticeably,  in  (3-1  where  the 
two  loops  interact.  This  analysis  shows  the  turn  structure  in  the 
central  core,  i.e.,  the  point  where  the  CH— n—  CH  “clip” 
interactions  occur,  to  be  highly  conserved  across  the  series  of 
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Figure  4.  X-ray  structure  representations  with  one  ring  displayed  as  space  filling  models  and  the  second  interlocked  ring  as  stick  figures,  (a)  Side 
view  of  F3b  and  (b)  bottom  view  looking  up  onto  the  CH— n—  CH  sandwiches,  (c)  Side  view  of  F3dph  and  (d)  top  view  of  (c)  highlighting  the  two 
different  /?-l  turns  (type  I'  for  ring  1  and  type  II'  for  ring  2).  (e)  Side  view  of  ho3c  showing  two  interlocked  tetramers  and  (f)  bottom  view  of  (e) 
highlighting  the  CH— n  interactions  between  L-NaphGly  and  L-HOPro  residues,  and  the  two  different  n—n  stacking  interactions  between  L-NaphGly 
and  L-4-Cl-PhGly  moieties.  All  hydrogen  atoms  except  those  participating  in  CH— n—  CH  and  CH— n—  n  sandwiches  are  omitted  for  clarity. 


Table  1.  Direct  Comparison  of  Metrical  /TTurn  Parameters  of  H3a,  F3b,  F3dph,  and  ho3c 

AF3dph  -  H3a  (°)a  Aho3c  -  H3a  (°)a 


dihedral  angle  (deg)  of  H3a 

AF3b  -  H3a  ( °)a 

ring  1 

ring  2 

ring  1 

ring  2 

P-1  (loop) 

«Vi 

60.1 

-4.6 

-4.1 

2.7 

1.3 

-5.6 

-145.9 

3.4 

196.9 

12.6 

6.4 

12.9 

-52.3 

0.1 

147.3 

-24.7 

-20.1 

-10.4 

'Vlt2 

-35.9 

-2.2 

-2.1 

1S.9 

25.2 

7.5 

P-2  (core) 

<tv, 

58.7 

6.0 

0.7 

3.4 

-0.7 

7.1 

-128.9 

-3.2 

-0.4 

-0.4 

-5.6 

7.6 

®i+2 

-62.7 

4.7 

-3.6 

-2.4 

-17.9 

-6.7 

-13.0 

-12.6 

-4.0 

-6.0 

20.2 

1.6 

P- 3  (loop) 

«Vl 

58.5 

-6.0 

-5.5 

-0.5 

2.2 

3.3 

-126.6 

-11.2 

-1.8 

-1.4 

-6.0 

-8.9 

®i+2 

-62.5 

-6.6 

-6.5 

-9.5 

-10.7 

-5.8 

-21.9 

17.6 

7.9 

1S.9 

7.1 

6.6 

P-4  (core) 

3>i+, 

-66.1 

1.2 

-3.4 

-5.3 

-5.8 

-6.0 

V 

-30.7 

-3.2 

-1.0 

-1.8 

-4.8 

0.8 

®i+2 

-159.0 

-3.5 

-1.4 

0.9 

-1.1 

-1.7 

83.0 

-0.2 

-6.8 

-8.6 

3.2 

-4.2 

aThe  corresponding  dihedral  angle  differences  in  H3a  vs  F3b,  F3dph,  and  ho3c.  Dihedral  angle  differences  of  >10°  are  in  bold  italics.  The  rmsd  metric 
of  structural  similarity  to  the  parent  structure  (H3a)  obtained  by  PyMOL  pair  fitting  using  48  arbitrarily  chosen  backbone  atom  pairs  for  F3b  vs  H3a, 
F3dph  vs  H3a,  and  ho3c  vs  H3a  are  0.246,  0.791,  and  1.067  A,  respectively.  See  details  in  SI1  Table  2. 


[2]-catenanes.  The  tight  packing  of  this  region  reinforces  the 
notion  of  a  highly  optimized  three-dimensional  structural  motif 
which  provides  significant  structural  stabilization. 

2.1.2.  Hydrogen-Bonding  Interactions.18  In  the  peripheral 
turns  (/?- 1  and  /?-3)  of  the  [2]-catenane,  the  expected  j/j+3 
intra-turn  hydrogen  bonds  (HBs)  are  observed  though  there 
are  no  inter-macro  cycle  HBs  (SI1  Table  3).  In  contrast,  the  /?- 
turns  in  the  core  are  distorted  to  enable  inter-ring  HBs  for  fi-2 
and  /?-4,  with  only  /?-2  containing  a  typical  // i+ 3  HB;  /?-4  has  its 


i+ 3  residue  engaged  in  a  conserved  inter-ring  HB.  The  series  is, 
therefore,  characterized  by  a  total  of  six  intra-turn  hydrogen 
bonds  for  fi-l,  fi-2,  and  /?-3,  and  six  inter-ring  hydrogen  bonds 
(/?-2(N)— >/?-4'(0),  p- 4(N)^-2'(0),  /?-2'(N)^/?-4(0),  p 
4'(N)->/?-2(0),  /?-4(N)^/?-4’(0),  and  /?-4'(N)-^-4 (O)).16 
Except  for  catenane  F3dph,  the  O, — N,+3  distances  for  inter-ring 
hydrogen  bonds  only  marginally  deviate  from  the  parent 
catenane  H3a  (<0.063  A)  (SI1  Table  4),  indicating  that  the 
skeleton  of  the  [2] -catenane  is  conserved  across  the  series.  As 
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Table  2.  Direct  Comparison  of  the  Distances  (A)  Characterizing  CH— n  or  k—k  Stacking  Interactions  in  H3a,  F3b,  F3dph,  and 
ho3c 


(b) 


X-ray  structural  representations  of 
H3a  showing  the  distances  (a)  be¬ 
tween  three  carbons  of  L-Pro  CHs 
(C3H,  C4H  and  C5H)  and  an  imagi¬ 
nary  naphthyl  plane  of  the  L- 
NaphGly  unit  and  (b)  between  the 
naphthyl  plane  and  an  Aib  methyl 
group  carbon  atom. 


AF3dph  - 

H3a  ( A)b’c 

Aho3c 

-  H3a  (A)b 

H3a  (A)a 

AF3b  -  H3a  (A)b 

ring  1 

ring  2 

ring  1 

ring  2 

L-proline  and  aryl 

L-Pro  C3 

3.999 

-0.291 

-0.297 

-0.260 

-0.215 

-0.315 

L-Pro  C4 

3.922 

-0.311 

-0.289 

-0.450 

-0.144 

-0.374 

L-Pro  C5 

3.916 

-0.163 

-0.074 

-0.317 

-0.136 

-0.310 

X  and  aryl 

ii 

£ 

X  =  Ac6c 

X  = 

Ac4c 

X  =  L- 

4-Cl-PhGly 

3.425 

0.089 

0.629d 

0.133 

0.167 

0.120 

0.078 

aThe  distances  between  an  imaginary  L-NaphGly  plane  and  the  corresponding  carbons  of  three  L-Pro  CH’s,  and  an  Aib  methyl  carbon  and  the  l- 
NaphGly  plane  in  H3a.  ^The  differences  in  the  corresponding  distances  indicating  CH— n  or  Jt—n  stacking  interactions  in  H3a  vs  F3b,  F3dph;  and 
ho3c.  The  negative  values  represent  the  shorter  distance  compared  to  those  in  H3a,  implying  stronger  interactions.  For  the  ring  assignment;  see 
Figure  4d,f.  cAn  imaginary  phenyl  plane  of  the  L-PhGly  unit  was  used  for  computing  the  distances  in  F3dph.  ^Following  the  distance  cutoff  (X---M  < 
4.3  A  [Steiner;  T.;  Koellner;  G.  /.  Mol.  Biol.  2001 ,  305;  535—557];  the  distance  between  the  L-NaphGly  plane  and  the  carbon  of  the  second  CH2 
group  (C2H2)  in  Ac6c  moiety  was  included  (see  SI1  Table  5)  and  compared  with  the  distance  between  an  Aib  methyl  carbon  and  the  NaphGly  plane 


will  be  discussed  in  more  depth  in  the  following  paper/4  the 
Oj — N/+3  distances  for  two  inter-ring  hydrogen  bonds  in  F3dph 
(y 9-2(N)->/M-'(0 )  and  /?-2'(N)->/?-4 (O))  are  shorter  than 
those  in  the  rest  of  the  catenanes.  The  trigger  for  the  shortening 
of  these  key  inter-ring  HBs  is  the  replacement  of  a  naphthyl 
group  for  a  phenyl  group;  which  initiates  a  series  of  changes  to 
the  network  of  non-covalent  interactions. 

2.1.3.  CH-n  Interactions  in  the  Core.  CH -n  interactions 
are  well-known  stabilizing  molecular  interactions.19,20  Since  an 
aryl  group  in  the  l;l-2  and  2ph  monomers  is  required  for 
catenane  formation  and  the  catenane  adopts  a  fold  that 
sandwiches  this  arene  between  an  L-proline  unit  and  the  X- 
moiety  of  the  other  macrocycle;  CH— n  interactions  are 
reasonably  inferred  as  an  important;  if  not  critical;  stabilizing 
interaction.  Metrical  parameters  from  H3a  indicate  a  con¬ 
strained  geometry  that  maximizes  the  CH— n  interaction 
between  naphthyl  and  L-Pro  on  one  side  (3.9— 4.0  A)  and 
Aib  on  the  opposite  (3.4  A)  (Table  2).  Within  the  new  series  of 
catenanes;  F3dph  and  F3b  follow  a  similar  CH— n  arrangement; 
with  a  geminally  substituted  cyclohexane  replacing  the  Aib 
methyl  group  of  H3a  below  the  naphthyl  ring  in  F3b;  and  a 
cyclobutyl  group  replacing  it  in  F3dph  (see  SI1  Table  5  for  key 
metrical  parameters). 

In  the  case  of  catenane  ho3C;  the  naphthyl  ring  is  pinched 
between  L-Pro  and  a  4-Cl-phenyl  residue;  i.e.;  a  CH— n—  n 
molecular  clip.  The  non-C2-symmetric  nature  of  ho3c  is 
expressed  in  the  differential  n—n  interaction  between  naphthyl 
and  the  4-Cl-phenyl  unit;  one  being  biased  toward  a  nearly 
eclipsed  face-to-face  interaction;  while  the  other  is  better 
described  as  intermediate  between  an  edge-to-face  and  face-to- 


face  (~33°,  SI1  Table  5).  NMR  experiments  (vide  infra) 
indicate  that  the  complex  is  fhmonal,  as  only  a  single  C2- 
symmetric  structure  is  observed  in  XH  NMR  spectra. 

A  comparison  of  the  metrical  parameters  for  these  catenanes 
to  H3a  was  informative  (Table  2).  In  F3b  and  H°3c;  shorter 
distances  between  the  naphthyl  group  and  the  F-  or  OH- 
substituted  proline  moieties  agreed  with  the  expected  enhance¬ 
ment  of  the  CH-^s  electrostatic  component;  which  would  be 
stabilized  by  a  more  positively  charged  CH  interacting  with  the 
arene’s  quadrupole.19  Despite  the  weaker  n— donor  character¬ 
istics  of  Ph  versus  naphthyl/1  the  decrease  in  the  CH— n 
distances  of  F3dph  compared  to  H3a  coupled  with  the  stronger 
thermodynamic  stability  of  F  vs  non-F  catenanes  (see  our  next 
paper14)  reinforce  the  argument  that  the  CH— n  interaction  can 
be  electronically  manipulated;  and  that  the  more  acidic  CH 
bonds  of  F-substituent  prolines  enhance  the  strength.  Zondlo 

has  previously  noted  this  effect  in  short  peptide  model  systems 
.  19 

in  water. 

On  the  opposite  face  of  the  arene  clip;  the  interactions 
between  the  aryl  group  and  Ac4C;  Ac6c;  and  4-Cl-phenyl 
residues  (F3dph;  F3b  and  ho3c)  were  variable.  The  combination 
of  a  naphthyl  group  and  the  Ac6c  moiety  was  structurally  similar 
to  H3a;  except  that  the  conformationally  restricted  Ac6c  group 
(cf.  CH3)  orients  a  single  CH  into  the  aromatic  ring;  which 
shifts  this  H  resonance  to  higher  field  than  any  other  CH— n 
group  in  the  XH  NMR  spectrum;  vide  infra.  The  longer  distance 
between  the  Ph  centroid  and  the  Ac4c  group  in  F3dph  (cf.  H3a; 
Table  2)  indicates  that  the  weaker  n— donor  character  of  a  Ph 
group  causes  a  lengthening  of  the  CH— n  distances;  implying 
that  the  core  might  be  less  tightly  packed.  In  the  same  way; 

dx.doi.org/10.1021/ja302345n  I J.  Am.  Chem.  Soc.  2012,  134,  11415-11429 
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catenane  ho3c  displays  lengthened  n—n  interactions  in  one  of 
its  two  rings,  suggesting  that  the  optimum  CH —n—n 
arrangement  cannot  be  simultaneously  achieved  in  both  halves 
of  the  molecule. 

2.1.4.  Aryl  Barrel.  The  gross  features  of  the  catenanes  are 
established  by  the  connection  of  the  peripheral  /Tturn  loops  to 
the  dense  core.  This  connection  is  achieved  through  1,4- 
hydrazone/benzamide  linkages  that  create  a  barrel-like 
structure.  The  aromatic  rings  interact  with  one  another  through 
a  combination  of  slipped  face-to-face  and  edge-to-face  arrange¬ 
ments  (Figure  S). 20,22  Of  the  eight  aromatic  rings  making  up 


Figure  5.  X-ray  structural  representations  of  F3dph,  emphasizing  how 
the  1,4-hydrazone/benzamide  linkages  establish  an  aryl  barrel  through 
n—n  stacking  interactions  (face-to-face  and  edge-to-face).  The  A,  B,  C, 
and  D  linkages  are  directly  connected  to  the  core  L-NaphGly  (/?-4),  d- 
Pro  (/?- 2),  Ac4c  (/?-2),  and  L-Pro  (/?-4)  residues,  respectively. 


this  barrel,  a  number  of  the  centroid-to-centroid  distances  were 
illuminating.  For  catenanes  H3a,  F3b,  and  ho3c,  each  of  which 
has  a  type  IT  /?-l  turn,  the  A  and  A'  aryl  rings  (Figure  5)  occupy 
positions  at  the  interface  of  the  interlocked  region.  The  arene 
rings  interact  through  a  slipped  face-to-face  orientation  with  a 
nearly  invariant  plane-to-plane  distance  of  3.86  A.  A  similar 
geometry  but  significantly  shorter  distance  (by  ~0.2  A)  was 
noted  for  F3dph,  which  adopts  a  single  type  V  /?-l  turn  (F3dph) 
(Table  3  and  SI1  Table  6).  The  combination  of  a  type  IT  and  a 
type  I'  in  the  two  interacting  fi-l  turns  establishes  an  H-bond 
between  the  proline  amide  carbonyl  in  ring  2  and  the  amide 
NH  of  the  Ac4c  in  ring  1  (N — O  distance  =  3.175  A).23  It  thus 
seems  reasonable  that  weaker  CH —n  interactions  in  the  core 
(i.e.,  Ph  vs  naphthyl)21  create  a  more  flexible  core  that  can 
compensate  by  optimizing  aromatic  interactions  in  the  barrel 
region. 


Other  centroid  distances  informed  on  the  width  of  the  barrel 
and  hinted  at  potential  for  compensating  effects  in  this  region. 
For  example,  compared  to  H3a's  barrel,  those  of  F3dph  and 
ho3c  are  smaller;  in  particular,  the  A-to-C  centroid  distances 
are  considerably  shorter,  implying  that  these  structures  benefit 
from  greater  aromatic  stabilization.  The  barrel  size,  as  estimated 
by  the  product  of  the  A— C  and  B— D  distances  support  such  a 
compression  in  F3dph  and  ho3c. 

2.2.  NMR  Analysis.  2.2.1.  CH-n  Interactions.  The  C2- 
symmetric  structure  of  catenane  H3a  was  found  to  persist  in 
solution,  and  despite  its  size,  high-resolution  spectra  could  be 
obtained  in  pyridine-d5,  which  acted  to  spread  the  chemical 
shifts  of  the  lH  NMR  resonances.7  Close  contacts  between  a 
single  methyl  group  of  Aib  and  a  proline  residue  to  the 
naphthyl  ring  in  the  clip  was  noted  by  their  upfield-shifted 
resonances,  and  appropriate  cross-peaks  in  the  ROESY 
spectrum.7  Catenane  F3b  also  provided  a  sharp  spectrum 
with  well-dispersed  signals  that  corresponded  to  a  C2- 
symmetric  structure  (6—8%  MeOH-d4/CDCl3)  (Figure  6a). 
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Figure  6.  500.1  MHz  lU  NMR  spectra  at  20  °C  of  (a)  F3b  in  7% 

MeOH-d4/CDCl3;  (b)  F3dph  in  pyridine-d5;  (c)  ho3c  in  pyridine-d5. 


Table  3.  Direct  Comparison  of  the  Centroid-to-Centroid  Distances  (A)  in  the  Aryl  Barrel  of  H3a,  F3b,  F3dph,  and  ho3c 


AF3dph  -  H3a  (A)*  Aho3c  -  H3a  (A)b 


H3a  (A)a 

AF3b  -  H3a  (. k)b 

ring  1 

ring  2 

ring  1 

ring  2 

A  to  A'  (C2-related  ring) 

3.855 

0.010 

-0.196 

-0.196 

-0.044 

-0.044 

A  to  C  (intra-ring) 

8.938 

0.028 

-0.813 

-0.775 

-1.162 

-0.943 

B  to  D  (intra-ring) 

9.121 

0.249 

0.298 

0.234 

0.683 

0.289 

barrel  size  (A2 * *) 

81.523c 

2.488d 

-4.994d 

— 4.971 d 

—5.287^ 

— 6.290‘ 

aDistances  (A)  between  imaginary  centroids  computed  for  each  aryl  linkage  in  the  barrel  as  identified  in  Figure  5.  ^Differential  distances  (A) 
describing  both  close  contacts  and  barrel  widths.  Two  values  are  reported  for  F3dph  and  ho3c,  as  these  compounds  are  non-C2-symmetric  in  the  solid 
state.  Negative  values  described  shorter  distances.  cBarrel  size  is  simply  obtained  from  a  multiplication  of  two  barrel  distances,  i.e.,  (distance  of  A  to 
C)  X  (distance  of  B  to  D).  ^Differential  barrel  size  (A2).  Negative  values  described  shrunken  barrel  areas. 
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The  F-substituted  L-proline  residues  provided  a  useful  handle 
for  structural  analysis  as  the  geminal  H*  participates  in  the 
CH —n  interactions  of  the  clip  (Figure  7).  The  H*  was  assigned 
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Figure  7.  Portion  (<5  =  —0.5  to  5.0  ppm)  of  400.0  MHz  XH  NMR 
spectra  of  F3b  at  20  °C  in  7%  MeOH-d4/CDCl3:  (a)  19F  coupled  and 
(b)  19F  decoupled. 


to  a  doublet  at  4.0  ppm  (JHF  =  53.1  Hz),  upheld  shifted  from 
the  monomer  (8  =  5.1  ppm,  JHF  =  52.4  Hz).  This  assignment 
was  confirmed  by  comparison  to  XH  NMR  spectra  obtained 
with  19F  decoupling  (Figure  7  and  SI1  Figure  5).  Cross-peaks 
indicative  of  CH— n  interactions  between  the  L-FPro  CH's  and 
the  naphthyl  group  were  also  observed  in  the  ROESY  spectrum 
(Figure  8). 24  Consistent  with  its  solid-state  structure,  three 
L-FPro  CH  cross-peaks  to  the  naphthyl  group  were  noted  (see 
inset  Figure  8).  The  ROESY  spectrum  of  F3b  also  showed  that 
one  cyclohexyl  hydrogen  (8  =  —0.2  ppm)  was  significantly 
shielded  by  its  close  interaction  with  the  opposite  face  of  the 
arene  (vide  supra).  Unlike  the  Aib  case,  the  CH  group 
participating  in  the  C— H— n  interaction  does  not  exchange  with 
a  non-interacting  position  (CH3  rotation)  and  is  thus  more 
significantly  shifted  by  its  aromatic  interaction.  The  obvious 
near-neighbor  contacts  from  the  solid-state  structures  are 
clearly  maintained  in  solution. 

The  remaining  catenanes,  F3dph  and  ho3c,  provided  high- 
resolution  XH  NMR  spectra  in  pyridine-d5  solvent  (Figure 
6b, c).  Contrasting  their  non-C2-symmetric  solid-state  struc¬ 
tures,  however,  these  spectra  were  symmetric,  suggesting  time- 
averaged  dynamic  behavior.  In  the  case  of  F3dph,  this  would 
require  interconversion  of  either  the  fi-l  type  F  into  a  type  IF 
turn  or  the  isomerization  of  the  other  loop's  type  IF  turn  into  a 
type  F  to  regenerate  C2  symmetry.  A  more  subtle  slipped  n/n  to 
edged  n/n  exchange  is  required  for  macrocycle  equilibration  in 
ho3c. 

Despite  representing  time-averaged  spectra,  the  proline  CH 
units  participating  in  the  CH— n  interactions  of  both  catenanes 
are  diagnostic  and  upfield-shifted  in  their  respective  ROESY 
spectra  (SI1  Figures  11a  and  13a).  The  CH— n  interacting  CH2 
unit  of  the  Ac4c  residue  in  F3dph  are  also  upfield-shifted,  and 
cross-peaks  to  the  Naph  group  in  its  ROESY  spectrum  are 
observed.  Unlike  the  Ac6c  case,  which  orients  the  equatorial 
CH  to  the  arene,  the  Ac4c  analogue  projects  both  geminal  CH's 


Figure  8.  Portion  of  the  2D  600.1  MHz  ROESY  spectrum  of  F3b  (FI 
8  =  —0.5  to  5.0  ppm;  ¥2  8  =  6.5— 8.5  ppm;  7%  MeOH-d4/CDCl3,  20 
°C).  Inset:  solid-state  structure  showing  CH— n  interactions  of  a  l- 
proline  residue  and  an  Ac6c  group  to  the  naphthyl  group. 


toward  the  arene,  and  both  thus  exhibit  upheld  shifting  and 
cross-peaks  to  the  Naph  in  the  ROESY  spectrum. 

Despite  the  degree  of  spectral  overlap  in  the  aromatic  region 
of  ho3c,  the  4-Cl-phenyl  residue  participating  in  the  n—n  stack 
could  be  identified  on  the  basis  of  a  combination  of  TOCSY 
and  ROESY  spectra  and  its  solid-state  structure  (SI1  Figures 
12—14).  Indicative  were  upfield-shifted  aryl  resonances  at  <5  = 
6.74  and  6.80  ppm  (the  remaining  aryl  signals  resonate  from  8 
-  7.05  to  7.92  ppm),  which  additionally  showed  cross  peaks  to 
the  naphthyl  group  (SI1  Figure  14). 

In  addition  to  providing  information  relevant  to  dynamics  in 
the  periphery  of  the  catenane,  NMR  analysis  also  revealed  that 
some  rotational  motion  is  possible  in  the  core  of  the  3 
structure.  Since  a  full  assignment  of  the  XH  NMR  spectrum  of 
F3dph  was  achieved  (ROESY,  COSY,  TOCSY,  and  HSQC,  SI1 
Figures  9—11),  these  data  combined  with  spectral  integration 
indicated  that  rotation  of  the  templating  Ph— CH  moiety  was 
rapid  on  the  NMR  time  scale  as  the  chemically  inequivalent 
ortho/ortho '  and  meta/meta'  positions  were  time  averaged 
(Figure  9,  inset).  Thus,  despite  the  apparent  high  density,  the 
near  invariance  of  the  core's  structure,  and  the  tightness  of  the 
CH— n—  CH  sandwich,  solution  dynamics  can  enable  even 
highly  congested  groups  to  rotate  on  the  NMR  scale. 

2.2.2.  n—n  Interactions  in  the  Aromatic  Barrel.  In  the  solid- 
state  structure  of  each  catenane,  the  1,4-hydrazone/benzamide 
linkages  assemble  through  a  combination  of  slipped  face-to-face 
and  edge-to-face  n— ;r-stackin|  interactions  into  a  tightly  packed 
aromatic  barrel  (Figure  5).  '  These  close  contacts  were 
investigated  spectroscopically  in  F3dph  due  to  the  simplified 
nature  of  its  aromatic  region.  Spectra  in  pyridine-d5  generated 
well-dispersed  amide  NH's,  imine  CH=N,  and  aromatic 
signals.  The  COSY  and  TOCSY  spectra  enabled  the  aryl  barrel 
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Figure  9.  Portion  of  the  2D  500.1  MHz  ROESY  spectrum  of  F3dph 
(FI  8  =  5.9— 9.5  ppm;  F2  8  =  5.9— 9.5  ppm;  pyridine-d5,  20  °C). 
Ar^:  2,  Ar#12,  and  ArAx  2  represent  the  remaining  aryl  ring  CH’s  (B, 
C,  and  D).  Inset:  solid-state  structure  indicating  ArOi)2,  neighboring 
imine  CH  groups  in  the  ring  A'  and  C  (A— CH=N,  and  C— CH=N). 


CH’s  and  the  phenyl  CH’s  in  the  core  to  be  distinguished  (SI1 
Figures  9  and  10),  while  the  HSQC  spectrum  differentiated  the 
amide  NH’s  from  the  imine  CH’s  by  the  latter  s  association  to 
an  imine  carbon  (N=CH)  (SI1  Figure  lib).  In  addition  to 
signals  in  the  expected  spectral  region,  two  resonances  were 
consistently  found  upheld  shifted  in  the  *H  NMR  spectra  ( 8  = 
5.9— 6.5  ppm). 

Key  to  their  deconvolution  was  the  strong  through-space 
correlation  between  the  most  upheld  of  these  CH’s  (Ar^x 
(2H),  8  =  6. 1  ppm)  and  a  neighboring  imine  CH  group  (A- 
CH=N  (1H),  8  =  6.25  ppm)  (ROESY,  blue  boxes  in  Figure 
9),  which  overlaps  with  CHa  in  this  compound.  In  the  Figure  9 
inset,  these  correspond  to  the  circled  hydrogens,  one  of  which 
is  being  viewed  through  aromatic  ring  B  (hence  the  upheld 
shift).  Each  of  these  resonances  could  then  be  correlated  to  a 
network  of  three  symmetry-inequivalent  sets  of  aryl  CH’s 
(black  boxes  in  Figure  9),  thus  establishing  their  close  contact 
and  the  spatial  proximity  of  the  remaining  1,4-substituted  rings 
(B,  C,  and  D).  Additionally  noted  was  a  strong  correlation 
(green  boxes  in  Figure  9)  between  two  imine  CH’s  (A  —CH= 
N  and  C— CH=N  in  Figure  9  inset),  consistent  with  one 
particularly  close  contact  in  the  solid-state  structure  (3.4  A). 
The  high  degree  of  structural  order  as  measured  by  the 
preciseness  of  inter-aromatic  contacts,  and  homology,  as 
assessed  by  the  degree  to  which  key  ROESY  cross-peaks 
repeat  across  the  series  of  catenanes,  imply  that  the  aryl  barrel 


structure  is  conserved  across  the  series  and  is  thus  an  important 
contributor  to  catenane  stability. 

2.2.3.  H/D  Exchange.  The  four  hydrazone  NH’s  ( 8  =  10.7— 
11.3  ppm)  and  the  four  amide  NH’s  of  H3a  resonate  in  the 
downfield  portion  of  the  *H  NMR  spectra  (pyridine-d5,  20 
°C).7  The  amide  signals  were  differentiated  from  the  imine  CH 
signals  in  the  HSQC  spectrum  by  the  latter  s  correlation  to 
their  imine  carbons.  The  (3-4  amide  NH  was  specifically 
identified  by  its  COSY  cross  peak  to  the  adjacent  CHa  of  the 
aryl  glycine.  Since  these  NH  groups  (except  for  the  (3-1  and  (3-3 
amide  NTs)  participate  in  intra-  and  inter-macrocyclic 
hydrogen-bonding  interactions  (vide  supra),  H/D  exchange 
reactions  were  utilized  to  investigate  the  inherent  structural 
flexibility  of  the  catenane  structure. 

In  pyridine-d5/D20,  all  NH  groups  of  H3a  exchanged  within 
15  h  with  the  exception  of  the  amide  NTs  of  the  (3-2  and  (3-4 
turns  (Figure  10);  these  latter  amide  NH’s  were  unchanged 


Figure  10.  Portion  ( 8  =  5.8—11.8  ppm)  of  the  599.8  MHz  XH  NMR 
spectra  of  H3a  (pyridine-d5  (0.6  mL)  +  D20  (0.03  mL),  20  °C), 
recorded  at  different  time  from  mixing. 


even  after  12  days.  Since  they  are  positioned  near  the  Aib  and  l- 
Pro  units  participating  in  the  CH —n  interactions  and 
additionally  form  inter-ring  HBs  (solid  state),  these  data 
imply  that  the  core  created  by  the  (3-2  and  (3-4  turns  is  densely 
packed  and  relatively  inaccessible.  In  contrast,  the  loops 
comprised  of  the  (3-1  and  (3-3  turns  are  relatively  flexible  and 
solvent  accessible  according  to  H/D  exchange. 

Catenane  H°3c,  which  adopts  a  non-symmetric  solid-state 
structure  but  a  C2-averaged  XH  NMR  spectrum,  was  subjected 
to  similar  H/D  exchange  experiments  to  determine  if  this 
phenomenon  was  indicative  of  a  more  dynamic  structure;  if  the 
non-symmetric  solid-state  structure  was  simply  a  crystal 
packing  effect,  then  an  enhanced  H/D  exchange  rate  was  not 
expected.  Additionally  hinting  at  a  “looser”  arrangement  were 
the  longer  CH— n  and  n— n  distances  between  the  NaphGly  and 
the  L-4-Cl-PhGly  units.  In  this  case,  all  NH  groups  exchanged 
within  2.6  days,  and  the  adjacent  CHa  transformed  from  a 
doublet  into  a  broad  singlet  lacking  the  normally  observed 
HC— NH  vicinal  coupling  (Figure  11a).  The  contrasting 
exchange  of  the  core  amide  NH’s  {(3-2  and  (3-4),  in  particular, 
indicates  that  the  core  is  less  tightly  packed  in  solution  and 
more  accessible  than  the  Aib  variant.  Since  the  intermacrocyclic 
HB  lengths  in  the  core  are  similar  to  H3a,  this  higher  reactivity 
appears  to  be  driven  by  a  weaker  CH— n— n  clip  as  compared  to 
the  CH— n—  CH  clip.  Alternatively,  the  dynamics  that 
accompany  a  structural  rocking  between  the  parallel  and 
edge-to-face  aromatic  orientations  might  enhance  the  cate- 
nane’s  accessibility  to  H/D  exchange  (Figure  4e,f). 

For  catenane  F3dph,  the  unique  (3-1  turn  of  one  ring  and  the 
longer  CH— n  distances  coupled  with  the  symmetric  solution 
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Figure  11.  (a)  Portion  ( S  =  5.8-12.5  ppm)  of  the  500.1  MHz  lU 
NMR  spectra  of  ho3c  (pyridine-d5  (0.6  mL)  +  D20  (0.03  mL),  20 
°C),  recorded  at  different  time  from  mixing.  The  bottom  spectrum  is 
the  same  portion  of  ho3c  in  pyridine-d5  before  adding  D20.  (b) 
Portion  (S  =  5.6-12.0  ppm)  of  the  500.1  MHz  lU  NMR  spectra  of 
F3dph  (pyridine-d5  (0.6  mL)  +  D20  (0.03  mL),  20  °C),  recorded  at 
different  time  from  mixing.  The  bottom  spectrum  is  the  same  portion 
of  F3dph  in  pyridine-d5  before  adding  D20. 


structure  suggested  a  compound  that  was  more  flexible  and 
dynamic  than  the  parent  H3a.  In  this  case,  all  hydrazone  NH 
groups  exchanged  within  2  h  in  pyridine-ds/D20  and  the  H/D 
exchange  of  all  amide  NH  groups  was  complete  within  12  days 
(Figure  lib).  The  rates  of  H/D  exchange  rates  therefore  follow 
the  trend  of  ho3c  >  F3dph  >  H3a.  For  the  catenane  F3b,  its  poor 
solubility  in  water-miscible  polar  solvents  precluded  its  study  in 
an  analogous  fashion. 

Together  these  results  indicate  that  H/D  exchange  reactions 
can  be  used  to  probe  for  dynamic  processes  that  perturb  the 
structure  of  what  is  otherwise  a  highly  compact  assembly.  While 
perhaps  related  to  the  dynamic  processes  that  symmetrize  the 
*H  NMR  spectra,  these  latter  “breathing”  dynamics  occur  on 
the  sub-millisecond  time  scale  and  are  thus  considerably  faster 
than  H/D  exchange.  Perturbations  in  the  metrical  parameters 
thus  correlate  well  to  the  longer  time  scale  processes  that  H/D 
exchange  reveals. 

3.  Self-Assembled  [2]-Catenanes  with  Rings  of  Differ¬ 
ent  Mass  (F4b).  3.1.  LC-MS/MS  Analyses  ofF4b  and  H4a  by 
LC-QTOF.  Previous  MS/MS  analysis  of  the  minor  [2] -catenane 
H4a  implied  that  it  was  composed  of  one  [lalalaH2]  ring 
and  a  symmetric  tetramer  with  alternating  la  and  H2  units, 
[la-H2-la-H2].7  This  suggestion  was  based  on  the  lack  of  a 
[h22+ H]+  dimeric  daughter  ion  (Scheme  2).  To  confirm  the 
structure,  we  investigated  [2] -catenane  F4b  since  it  was  the 
major  compound  in  the  corresponding  DSA  mixture.  LC-MS/ 
MS  analysis  on  purified  F4b  and  a  DSA  solution  containing  the 
catenane  F4b  (plus  others)  by  LC-QTOF  significantly 
improved  our  previous  experiments'  resolution,  sensitivity, 
and  mass  accuracy. 

For  both  samples,  the  doubly  charged  ion  ([■ F4b  +  2H]2+,  ml 
z  -  1588.7219)  was  selected  for  fragmention  on  the  basis  of  its 
high  abundance  under  the  electrospray  conditions.  Both 
analyses  showed  two  tetrameric  daughter  ions  ([ lb3F21  +  H]  + 


Scheme  2.  Possible  Daughter  Ions  for  the  Isomers  of  F4b 
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and  [ lb2F22  +  H]+),  three  trimeric  ions,  and  two  dimeric 
daughter  ions  ([lb2  +  H]+  and  [lblF2l  +  H]+)  with  good 
amplitude  (Figure  12  and  SI1  Figure  15).  The  key  dimeric 
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Figure  12.  (a)  Targeted  doubly  charged  ion  in  MSI  stage  for  LC-MS/ 
MS  analyses.  LC-QTOF  (CID  MS/MS)  spectra  of  the  isolated  F4b  at 
a  collision  energy  of  (b)  31.8,  (c)  41.8,  and  (d)  51.8  V.  The  y-axis  is 
normalized. 


daughter  ion  [F22  +  H]+  (Scheme  2),  which  only  occurs  in  the 
non-alternating  [lb2F22]  structure,  was  observed  in  both  cases, 
but  with  only  trace  intensity. 

Since  we  have  previously  demonstrated  that  hydrazones  are 
susceptible  to  ion— ion  crossover  in  the  mass  spectrometer,  it 
was  possible  that  this  fragment  formed  during  the  electrospray 
ionization  process.25  An  LC-MS/MS  analysis  of  the  DSA 
sample  containing  H4a  provided  the  same  net  result:  only  traces 
of  [ h22  +  H]+  ion  (SI1  Figure  16). 

To  investigate  the  possibility  that  the  traces  of  [F22  +  H]  + 
were  being  generated  through  ion-crossover  pathways,  an 
alternative  MS  approach  was  developed.  By  manipulating  the 
MS  parameters,  the  catenanes's  two  tetrameric  ions  could  be 
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generated  in  their  singly  charged  state  in  MS  stage  1  (Scheme 
3).  The  structures  of  these  ions  were  then  ascertained  by  a 


Scheme  3.  Possible  Daughter  Ions  for  the  Isomers  of 
Tetrameric  [lb2F22]  Macrocycle 
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secondary  fragmention  in  MS2.  If  the  selected  tetrameric  ring 
was  alternating  ((ii)  in  Scheme  3),  only  the  hetero- dimeric 
daughter  ion  would  be  observed,  while  structure  (i)  would 
provide  all  three  possible  dimeric  ions. 

Following  this  protocol,  a  pure  sample  of  F4b  was 
decomposed  into  its  two  tetrameric  singly  charged  ions 
([ lb3F2l  +  H]+  and  [lb2F22  +  H]+)  in  the  MSI  stage.  The 
key  tetrameric  ion  [lb2  22  +  H]+  was  then  selected  and 
additionally  fragmented.  Since  all  possible  dimeric  daughter 
ions  were  detected  (Figure  13  and  SI1  Figure  17),  these  data 
suggested  that  the  structure  of  F4b  was  in  fact  (i)  in  Scheme  2. 

A  similar  experiment  on  H4a,  that  is,  the  decomposition  of 
H4a  into  two  tetrameric  singly  charged  ions  ([la3H2l  +  H]+  and 
[la2"22  +  H]+)  in  the  MSI  stage  followed  by  the  selection  and 
fragmentation  of  the  key  tetrameric  ion  [  la2H22  +  H]+,  showed 
all  three  possible  dimeric  daughter  ions  ([ la2  +  H]+,  [la1H2l  + 
H]+,  and  [H22  +H]+),  although  the  key  homodimeric  ion  [H22  + 
H]+  was  of  low  intensity  (SI1  Figure  18).  We  therefore 
conclude  that  the  more  favorable  constitutional  isomers  of  H4a 
and  F4b  are  in  fact  (i)  in  Scheme  2,  and  that  tetramer 
fragmentation  to  [H22  +  H]+  is  unfavorable.  These  new  data 
support  a  correction  to  our  previous  assignment  as  (ii)  in 
Scheme  2.7 

3.2.  Solid-State  Analysis.  The  catenane  F4b  was  obtained  as 
single  crystals,  and  high-quality  X-ray  diffraction  data  were 
obtained.  The  octameric  [2] -catenane  consists  of  two  different 
interlocked  tetramers:  ring  1  is  composed  of  three  lb  and  one 
f2  units,  while  ring  2  contains  two  units  of  lb  and  F2  arranged 
in  a  non-alternating  manner  [laTa-F2-F2]  (Scheme  2  (i)  and 
Figure  14).  This  structure  confirmed  the  MS  data  and 
additionally  added  that  the  D-Pro-Ac6c  occupied  the  (3-1  and 
(3-2  turn  positions,  while  L-FPro-L-NaphGly  occupied  the  (3-3 
and  (3-4  positions. 

The  structure  of  F4b  is  similarly  flower-like,  but  compared  to 
F3b,  it  has  one  of  its  petal-like  loops  kinked  away  from  the 
pillar-like  central  region.  Each  tetrameric  macro  cycle  is  again 
saddle-shaped,  with  four  /Lturns  connected  by  an  extended  1,4- 
hydrazone/benzamide  linkage.  The  metrical  parameters  of  each 
/Lturn  indicate  that,  while  all  five  D-Pro-Ac6c  units  adopt  type 
IT  turns  and  the  two  L-FPro-L-NaphGly  units  in  the  core  adopt 
pseudo  type  VIII  or  type  VIII  turns  ((3-4  turns  in  ring  1  and  ring 
2,  respectively),  the  unique  L-FPro-L-NaphGly  unit  adopted  a 
type  I  turn  ((3-3  turn  in  ring  2)  (SI1  Table  7).  This  latter  turn  is 
that  most  commonly  encountered  in  L-Pro-L-X  structures.26 
Conversion  of  a  type  I  turn  to  the  type  VIII  required  for  (3-4 
requires  an  ~130°  rotation27  of  the  L-NaphGly  zT2  carbonyl 
(^1+2) •  This  carbonyl  is  then  positioned  to  participate  in  a  key 
inter-ring  HB  (SI1  Tables  7  and  9).  In  hindsight,  the  greater 


(a)  MS  1 

[1b5F23+2H]2+ 
1588.7110 
[7/>f,F23+H+Na]2+ 
1599.7022 

[fu/^+Hr 

1550.7232 


[7&5F23+2Na]2+ 
1610.6929 


1609.6901  [M+H]- 

1610.1914  1625.6951 
/  1610.6929  \1 626.6982 

Kl  61 1.1941  \\1 627.7001 


1615  1620  1625  m/z 


[1b2F22+H]+  [M+  H]+ 
1626.6982 
/  [1b2F22+ Na]+ 
1648.6794 


L 


1550 


(b)  32.5  V 


1575  1600  1625 

Counts  vs.  Mass-to-Charge  (m/z) 


1650 


176/2,+H] 

1181.5400 


[1/>if22+H]* 
1257.5074 

\  / 

.  ?  ■  I _ 


(c)  42.5  V 


[1b2+ H]+ 
737.3712 

i 

[fb,F2,+H]* 

813.3475 

.  1 

[f22+H]+ 

889.3239 

L 

730  760 

790  820 

850  880  m/z 

[M+  H]+ 
[7b2F22+H]+ 
1626.6928 


[1b,F22+ H]*-N2 
1229.5144 


1L 


[1b2F22+ H]*-H20 
1608.6836  > 


DB05O  \ 

i ll 


(d)  52.5  V [1b  F2  Hv-M -125  [Ibfc+HY-Wb  [1b2F22+ H]*-125 

1  1  z  J  z  1  -I  OO  /IOIO  _ _ 


[1b2F2,+ Hr-125\  J  • 

\  [1b/22+ H]*-N2 
\  1598.7011 

1 

1  1056.4442  v  | 

...  ,1  ,  ..NIL-  i.l 

.  >.\l 

600 

800  1000  1200 

1400  1600 

Counts  vs.  Mass-to-Charge  (m/z) 


Figure  13.  (a)  Tetrameric  singly  charged  ion  ( [lb2F22  +  H]+) 
generated  in  the  MSI  stage  for  LC-MS/MS  analyses.  LC-QTOF  (CID 
MS/MS)  spectra  of  the  targeted  tetrameric  ion  at  a  collision  energy  of 
(b)  32.5,  (c)  42.5,  and  (d)  52.5  V.  The  y-axis  is  normalized.  Two  filled 
circles  (green  and  pink)  indicated  two  trimeric  daughter  ions  ([lb2F21 
+  H]+  and  [lbF22  +  H]+),  respectively. 


adaptability  of  the  (3-1  and  / 3-3  turns  suggests  that  if  any  subunit 
were  to  be  replaced  by  a  less  optimal  version,  it  would  be  (3-1  or 
(3-3  (especially);  the  (3-2  and  (3-4  sites  would  be  the  least 
accommodating  to  a  non-optimal  replacement. 

Compared  to  the  identical  macrocycles  in  F3b,  ring  1  exhibits 
larger  torsional  angle  deviations  (SI1  Table  7).  While  the  core 
was  invariant  across  the  3  series,  the  (3-4  metrical  parameters  of 
F4b  deviate  from  F3b,  suggesting  significant  relay  of  remote 
structural  information  back  to  the  core,  and  an  alternative 
structural  solution  to  minimizing  the  overall  energy.  Although 
the  turns  in  the  core  deviate  from  those  in  F3b,  the  structure 
still  effectively  sandwiches  the  naphthyl  ring  between  the 
D-FPro  and  Ac6c  units  of  the  other  macrocycle  with  CH —n 
metrical  parameters  not  unlike  those  observed  in  the  3  series 
(SI1  Table  8).  Based  on  CH — n  distances,  the  interactions 
between  proline  and  NaphGly  in  each  ring  are  weaker  than  in 
the  C2-symmetry  analogue  F3b  but  still  stronger  than  those 
observed  in  H3a.  The  interactions  between  NaphGly  and  Ac6c 
in  each  ring  are,  however,  weaker  than  those  observed  in  both 
F3b  and  H3a,  which  is  likely  the  result  of  the  adjacent 
accommodating  (3-3  turn  of  an  unfavorable  turn  type  (type  I). 

Like  its  symmetric  analog  F3b,  F4b  has  six  turn-stabilizing 
hydrogen  bonds  in  (3-1 ,  (3-2 ,  and  (3-3  and  six  inter-ring  HBs 
(between  (3-2  and  (3-4  of  both  rings  and  between  the  (3-4  turns 
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Figure  14.  (a)  X-ray  structure  representations  of  F4b,  with  one  ring 
displayed  in  space-filling  and  the  second  inter-locked  ring  in  stick,  (b) 
Bottom  view  looking  down  onto  the  CH— n—  CH  sandwiches.  Among 
three  L-FPro-L-NaphGly  residues,  the  two  core  residues  form  CH— n 
interactions,  while  the  /?-3  loop  unit  does  not.  (c)  A  single  tetrameric 
ring  of  F4b  composed  of  three  lb  and  one  F2  units  (ring  1,  [lb3F21]). 
(d)  A  single  tetrameric  ring  of  F4b  composed  of  two  lb  and  two  F2 
units  (ring  2,  [lb2F22]).  Only  selected  hydrogen  atoms  are  displayed. 


of  each  ring)  (SI1  Table  9).  The  1,4-hydrazone/benzamide 
linkages  connecting  the  /Lturns  also  similarly  arrange  to  form 
an  aryl  barrel  (SI1  Table  10).  Like  catenane  F3dph,  which 
contains  weak  CH— n  interaction  units  (PhGly-Ac4c),  F4b  also 
displays  a  shortened  inter-ring  HBs  next  to  the  weak  CH —n 
moiety  (NaphGly-Ac6c).  Moreover,  this  effect  also  propagates 
to  the  adjacent  aryl  barrel  region  where  the  interfacial  (A— A') 
and  cross  barrel  (A—  C)  distances  also  shorten.  It  is  clear  that 
these  non-covalent  interactions  are  linked  phenomena. 

Of  course,  since  4  is  a  minor  product  in  most  DSA  reactions, 
the  combination  of  F-proline,  nap hthylgly cine,  and  Ac6c 
appears  to  be  uniquely  able  to  compensate  for  an  unfavorable 
unit  at  /?-3. 

3.3.  NMR  Analysis.  Like  F3b,  the  catenane  F4b  also  provided 
a  sharp  spectrum  with  well-dispersed  signals  in  pyridine-d5  (SI1 
Figure  19).  The  eight  hydrazone  NH’s  and  three  (doublet) 
amide  NH’s  (adjacent  to  CHa  in  the  L-NaphGly  residue)  were 
well  resolved  and  are  consistent  with  an  asymmetric  structure 
having  two  different  tetrameric  macro  cycles  and  three  L-Pro-L- 
NaphGly  residues.  The  lack  of  symmetry,  however,  made  for 
significant  complexity,  which  could  be  partially  overcome  by 
direct  comparison  to  the  symmetrical  and  similarly  well 
resolved  F3b  (7%  MeOH-d4/CDCl3).  Although  this  solvent 
choice  caused  partial  H/D  exchange  of  some  NH's,  the  spectra 
were  comparable  to  F3b. 

Since  catenane  F4b  has  three  F-substituted  L-proline  moieties 
and  only  two  are  engaged  in  CH —n  interactions,  the  two 
upfield-shifted  doublets  at  3.9  and  4.1  ppm  (4.0  ppm  in  the  *H 
NMR  spectrum  of  F3b)  were  assigned  to  the  geminal  H*  in 
each  of  the  /? -4  turns  (Figure  15a).  The  downfield  doublet  at 
5.0  ppm  (5.1  ppm  in  the  XH  NMR  spectrum  of  monomer  F2) 
was  assigned  to  H*  in  the  /?- 3  turn  fluoro-proline  of  ring  2;  19F- 
decoupled  *H  NMR  spectra  were  supportive  (Figure  15b). 
Using  the  upfield-shifted  H*  doublets  and  the  upfield-shifted 
CH's  in  the  Ac6c  as  an  anchor  point,  two  sets  of  the  F- 


Figure  15.  Portion  (S  =  3.2-5 .3  ppm)  of  the  400.0  MHz  lU  NMR 
spectra  of  F4b  at  20  °C  in  7%  MeOH-d4/CDCl3  (a)  coupled  with  19F 
and  (b)  decoupled  with  19F.  The  filled  circles  (red,  blue,  and  black) 
indicate  the  H*  signals  of  three  L-FPro  residues  described  in  the  insert 
picture.  A  portion  ( S  =  —176  to  —183  ppm)  of  the  400.0  MHz  19F 
NMR  spectra  of  F4b  at  20  °C  in  7%  MeOH-d4/CDCl3,  (c)  lU  coupled 
and  (d)  *H  decoupled. 


substituted  L-Pro  units  and  the  Ac6c  protons  involved  in  the 
CH— n  interactions  were  identified  by  TOCSY  (SI1  Figure 
20a).  Although  spectral  complexity  made  it  difficult  to  identify 
the  Naph  groups  involved  in  CH— n  interactions,  the  TOCSY 
spectrum  identified  six  of  the  eight  1,4-benzamide  groups, 
which  by  the  process  of  elimination  narrowed  the  Naph  region 
(SI1  Figure  20b).  The  HSQC  spectrum  identified  the  imine 
CH=N  (total  8)  and  three  CHa  resonances  (SI1  Figure  21 ). 

Based  on  these  data,  the  proline-naphthyl  and  Ac6c-naphthyl 
CH— n  interactions  could  be  deduced  from  the  ROESY 
spectrum  (SI1  Figure  23).  While  the  downfield  H*  assigned 
to  /?-3  lacked  cross-peaks  to  the  naphthyl  group,  several  upfield- 
shifted  proline  CH's  did  show  cross-peaks.  Similarly,  only  two 
sets  of  upheld  cyclohexyl  CH  resonances  corresponding  to  the 
two  inequivalent  Ac6c  CH's  displayed  such  cross-peaks  to  the 
Naph  groups  (SI1  Figure  23). 

Like  F3b  and  F3dph,  uniquely  upfield-shifted  resonances  for 
aryl  and  imine  CH’s  in  the  aryl  barrel  region  of  ring  1  and  ring 
2  were  also  noted  in  the  ROESY  spectrum,  this  time  doubled 
due  to  the  lack  of  symmetry  (SI1  Figure  24).  The  conservation 
of  these  and  numerous  other  close  contacts  across  the  3  series 
and  in  F4b  thus  implies  that  the  solid-state  structures  are 
preserved  in  solution. 

The  flexibility  and  dynamics  of  F4b  were  also  investigated 
through  H/D  exchange  reactions  in  pyridine-ds/D20.  Within 
3.4  days,  all  NH  groups  exchanged  and  the  three  CHas  next  to 
the  Naph  amide  NH’s  collapsed  into  the  broad  singlets  that  are 
indicative  of  vicinal  H/D  exchange  (SI1  Figure  25).  Overall, 
F4b  is  much  less  dynamic  and/or  accessible  to  H/D  exchange 
than  a  non-folded  monomer,  but  still  looser  and  more  dynamic 
than  the  symmetric  catenane  H3a. 
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■  DISCUSSION 

We  have  investigated  the  features  of  several  octameric  [2]- 
catenane  structures  derived  from  dipeptide  hydrazide  A— B 
monomers  in  the  solid  state,  along  with  solution  character¬ 
ization  through  high-resolution  NMR  spectroscopy  and  H/D 
exchange.  For  catenane  class  3,  the  structures  are  composed  of 
two  C2-related  interlocked  rings,  each  of  which  is  assembled 
from  three  units  of  1  and  one  unit  of  2.  By  contrast,  catenane 
class  4  comes  from  the  asymmetric  assembly  of  a  [1*1*  1*2]  and 
a  [  1-1  -2-2]  macrocyclic  unit.  In  all  cases  (class  3  and  4), 
monomer  2  is  of  the  L-Pro-L-ArylGly  stereochemistry,  and 
yields  for  the  catenane  were  maximized  when  the  aryl  group  is 
naphthyl,  though  phenyl  is  sufficient  to  template  self-assembly. 
The  family  of  catenanes  can  be  described  as  flower-like  with  a 
compact  core  where  the  aryl  of  the  arylglycine  is  pinched 
between  two  loops  of  the  saddle-shaped  tetrameric  ring  (Figure 
l).  The  key  core  formation  elements  are  CH— #  interactions 
provided  by  a  proline  on  one  side  and  the  methyl  of  an  Aib  on 
the  other.  Six  inter-ring  hydrogen  bonds  additionally  stabilize 
the  interior  core  structures.  From  this  highly  conserved  core 
emerge  petal-like  loops  that  terminate  in  /?-turns.  Each  Pro-X 
unit  adopts  a  standard  suite  of  /Tturns  (types  I,  F,  IT,  and  VIII) 
which  have  the  effect  of  reversing  the  chain  by  ~180°; 
deviations  from  180°  affect  the  spatial  projection  of  the  flat 
extended  hydrazone/benzamide  aromatic  units.  The  core  turns 
are  invariant  and  adopt  a  type  IF  turn  for  fi-2  and  a  type  VIII 
turn  for  [3-4,  while  the  peripheral  turns  (fj-l  and  fi- 3)  are  more 
variable  (types  II,  I,  and  F).  These  /Fturns  additionally  organize 
a  tightly  packed  aryl  barrel  with  little  free  volume  in  the  static 
structure,  though  transient  deformations  can  accommodate  aryl 
rotations  that  are  fast  on  the  NMR  time  scale.  It  is  in  this  region 
of  the  structure  that  the  two  macrocycles  thread  (Figure  16). 
The  barrel  is  characterized  by  an  array  of  slipped  face-to-face 
and  edge-to-face  ^-stacking  interactions20  and  resembles  the 
aromatic  clusters  observed  in  the  interior  of  proteins.28  Analysis 
of  these  structural  features  reveals  that  the  catenane  is  an 


Figure  16.  X-ray  structural  representations  of  F3dph  emphasizing  how 
the  benzamide  linkages  can  create  an  aromatic  pocket  for  binding  and 
threading  a  second  benzamide  chain. 


assembly  of  interwoven  non-covalent  interactions  and  that 
these  interactions  combine  to  maximally  stabilize  the  catenane. 
For  example,  when  the  core  CH— n  interactions  was  weakened 
by  substituting  a  naphthyl  for  an  aryl  /r-donor  (PhGly)  or  an 
unfavorable  turn  (type  I  /?-3  in  catenane  4),  reinforcing 
interactions  elsewhere  in  the  structures  compensated  for  the 
loss  of  stabilization  (contracted  inter-ring  hydrogen  bonds  or 
compressed  aromatic  interactions  in  the  aryl  barrel).  The 
following  article  examines  the  interrelated  nature  of  these 
interactions.14 

High-resolution  NMR  spectroscopy  suggests  that  the  solid- 
state  structures  are  largely  conserved  in  solution,  and  full 
assignments  were  achieved  in  several  cases.  H/D  exchange  in 
protic  D -solvents  were  also  informative,  showing  that  the  loop 
regions  undergo  H/D  exchange  at  relatively  fast  rates,  while  the 
interior  core  structures  are  much  less  dynamic.  In  one  case,  two 
key  amide  NH's  did  not  exchange  over  the  course  of  weeks, 
even  in  an  aggressive  solvent  (pyridine/D20).  We  take  these 
data  to  reflect  the  degree  of  motion  that  is  possible  in  the 
structure.  The  loops  which  are  more  flexible  and  exposed  are 
readily  deuterated,  while  the  interior  core  NH’s  are  constrained 
and  inflexible,  and  are  thus  less  prone  to  H/D  exchange. 
Comparisons  across  the  series  of  catenanes  reveal  that  where 
structural  deviations  occur  from  the  baseline  structure,  these 
distortions  correlate  with  higher  H/D  exchange  rates. 

The  /?- turns  are  also  important,  as  their  structures  act  to 
project  the  1,4-benzamide— hydrazone  linkers  that  create  the 
aromatic  binding  pocket.  This  binding  pocket,  which  can  be 
viewed  as  being  optimized  to  bind  an  identical  ring,  has  an 
opening  to  accommodate  the  entrance  of  a  suitable  guest 
(Figure  16).  Since  a  number  of  turn  types  are  available  in  a 
mixed  Pro-X  library,  one  can  envision  how  combinations  of 
turn  geometries  can  conspire  to  create  an  adaptable  class  of 
receptors  that  through  dynamic  unit  interchange  can  become 
optimized  for  guest-binding.5 

When  one  of  the  tetramers  is  computationally  excised  from 
the  interlinked  structure  previously  described,  the  reversals  of 
peptide  direction  achieved  by  the  /Fturns  become  clear.  The 
view  showing  these  linkers  in  Figure  16  is  additionally 
informative.  Note  how  the  benzamide  groups  of  the  aryl  barrel 
vertically  align  to  create  a  deep  aromatic  groove  that  is  open  on 
one  side  and  “binds”  the  threaded  benzamide  of  the  other  ring. 
Such  a  picture  suggests  that  non-catenated  tetramers  should  be 
capable  of  adopting  similar  conformers  that  present  deep 
aromatically  lined  pockets  for  the  recognition  of  flat  structures. 
Previous  work  by  us  and  others  has  shown  that  non-catenated 
tetramers  comprised  of  related  monomers  bind  guests  with 
aromatic  groups  like  protonated  nucleotides  and  cinchonidine 
salts.9c'd  The  current  results  suggest  a  mechanism  and  structure 
for  this  molecular  recognition. 

In  addition  to  displaying  a  benzamide  lined  binding  pocket, 
the  peptide  turns  provide  points  for  hydrogen  bond  donation 
and  acceptance  at  the  barrel's  periphery.  This  display  of  polar 
functionality  above  and  below  a  non-polar  interior  has  rough 
parallels  with  cyclodextrins,  calixarenes,  and  cucubiturils,  all 
well-established  molecular  receptors  29 

This  analysis  therefore  suggests  that  an  important 
component  of  the  binding  capability  of  these  structures 
comes  from  the  ability  to  present  an  aromatically  lined  cleft 
that  is  predisposed  for  binding.  One  presumes  that  such  a 
binding  site  could  accommodate  flat  arene  rings  via  n—n 
interactions  as  displayed  in  the  [2] -catenane  structures 
described  above,  but  could  also  utilize  strong  cation—#  and 
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perhaps  even  CH— n  interactions  to  bind  alternative  structural 
motifs.  The  display  of  polar  groups  above  and  below  this  site 
could  then  provide  secondary  or  tertiary  points  of  attachment 
as  the  [2]-catenanes  take  advantage  of  herein.  Although  we 
have  no  direct  evidence  for  this  binding  mode  in  any  DCC- 
selected  host— guest  complexes,  the  studies  described  here 
provide  our  first  structural  glimpses  into  the  mechanisms  of 
molecular  recognition  by  this  adaptable  class  of  compounds. 

■  CONCLUSION 

The  [2]-catenane  structures  described  herein  contain  many  of 
the  hallmarks  of  a  protein,  (l)  They  are  relatively  large  in  size 
(>3  kDa),  with  a  hydrodynamic  volume  that  is  large  enough  to 
benefit  from  NMR  pulse  sequences  designed  for  macro¬ 
molecules.  (2)  They  have  clear  components  of  secondary, 
tertiary,  and  quaternary  structure,  along  with  a  very  strong 
sequence-dependent  structure/stability  relationship.  (3)  Like 
proteins,  they  additionally  contain  distinct  regions  of  variable 
dynamic  behavior  (as  revealed  by  H/D  exchange).  Despite 
these  many  similarities,  however,  the  [2]-catenanes  simulta¬ 
neously  take  on  the  properties  of  small-molecule  receptors,  and 
are  thus  amenable  to  solid-state,  solution-state,  and  gas-phase 
characterization  tools.  Their  structures  can  be  readily 
manipulated  with  non-biogenic  amino  acids  to  test  the  effect 
of  electronic  and / or  steric  perturbations,  they  are  amenable  to 
relative  stability  determinations  via  their  self-assembly  proper¬ 
ties,  as  described  in  the  next  paper,14  and  they  self-assemble  in 
organic  solvents.  This  balance  of  physical  properties  positions 
them  at  a  bridge  point  between  proteins  and  small  molecules, 
and  enables  one  to  begin  investigating  how  diverse  non- 
covalent  interactions  acting  in  a  multi-valent  format  affect 
structure,  function,  and  stability  in  organic  solvents,  where  the 
balance  of  non-covalent  forces  differ  from  those  in  water.  These 
complex  three-dimensional  structures  provide  a  promising 
scaffold  for  pre-organization  of  functional  groups  needed  to 
create  enzyme-like  catalysts. 
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